Dynamic Collections in Indri
نویسنده
چکیده
Text search engines have historically been designed for unchanging collections of documents. While this is fine for many applications, a growing number of important applications in news, finance, law and desktop search require indexes that can be efficiently updated. Previous research into supporting dynamic collections revolves around incremental methods. Incremental systems are optimized for adding large batches of documents to an existing index. These systems do not generally allow for queries to run while an incremental update is taking place. This work presents recent changes to the Indri search engine to support dynamic collections. Unlike previous incremental systems, Indri does not require large batch sizes to achieve efficient indexing performance. Indri is also designed to be as concurrent as possible, allowing queries to run while documents are added to the system.
منابع مشابه
Effects of anthropogenic disturbance on indri (Indri indri) health in Madagascar.
Anthropogenic habitat disturbance impairs ecosystem health by fragmenting forested areas, introducing environmental contamination, and reducing the quality of habitat resources. The effect of this disturbance on wildlife health is of particular concern in Madagascar, one of the world's biodiversity hotspots, where anthropogenic pressures on the environment remain high. Despite the conservation ...
متن کاملNot just a pretty song: an overview of the vocal repertoire of Indri indri.
The vocal behaviour of wild indris inhabiting the area near Andasibe was studied by means of all occurrence sampling. We provide a quantitative overview of the vocal repertoire of Indri indri, describing qualitative contextual information and quantitative acoustic analysis for all the utterances we recorded from adult individuals. Other than the song, the repertoire of Indri indri comprises 8 v...
متن کاملIndri TREC Notebook 2006: Lessons Learned From Three Terabyte Tracks
This report describes the lessons learned using the Indri search system during the 2004-2006 TREC Terabyte Tracks. We provide an overview of Indri, and, for the ad hoc and named page finding tasks, discuss our general approach to the problem, what worked, what did not work, and what could possibly work in the future.
متن کاملIndri: a language-model based search engine for complex queries
Search engines are a critical tool for intelligence analysis. A number of innovations for search have been introduced since research with an emphasis on analyst needs began in the TIPSTER project. For example, the Inquery search engine introduced support for specification of complex queries in a probabilistic inference network framework. Recent research on language model-ing has led to the deve...
متن کاملComplete Genome Sequence of Torque teno indri virus 1, a Novel Anellovirus in Blood from a Free-Living Lemur
We identified Torque teno indri virus 1 (TTIV1), the first anellovirus in a free-living lemur (Indri indri). The complete circular 2,572-nucleotide (nt) TTIV1 genome is distantly related to torque teno sus virus. Phylogenetic and sequence analyses support TTIV1 as a putative member of a new genus within the Anelloviridae family.
متن کامل